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Abstract: The CNS needs to be understood at four nearly independent levels of description: 
(1) that at which the nature of a computation is expressed; (2) that at which the algorithms 
that implement a computation are characterized; (3) that at which an algorithm is committed 
to particular mechanisms; and (4) that at which the mechanisms are realized in hardware. 
In general, the nature of a computation is determined by the problem to be solved, the 
mechanisms that are used depend upon the available hardware, and the particular 
algorithms chosen depend on the problem and on the available mechanisms. Examples are 
given of theories at each level. 
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From understanding computation to understanding neural circuitry 

Complex systems, like a nervous system or a developing embryo, must be 
analyzed and understood at several different levels. Of course, there are logical and causal 
relationships among them, but the important point is that these levels of description are only 
loosely related. The underlying philosophical issue here is that reductionism does not imply 
constructionism. 

Thus the question we ask in this essay is, at what level is it necessary and 
profitable to study the information processing that is carried out during visual perception? 
For a machine that solves an information processing problem, there are four important 
levels of description. At the lowest, there is basic component and circuit analysis - how do 
transistors, neurons, diodes, and synapses work? The second level is the study of particular 
mechanisms; adders, multipliers and memories accessed by address or by content The third 
level is that of the algorithm, and the top level contains the theory of the overall 
computation. For example, take the case of Fourier analysis. The theory of the Fourier 
transform is well understood, and is expressed independently of the particular way in which 
it is computed. One level down, there are several algorithms for implementing a Fourier 
transform - the Fast Fourier transform (Cooley & Tukey 1965) which is a serial algorithm; 
and the parallel algorithms of holography that are based on mechanisms of laser optics. All 
these algorithms carry out the same computation, and the choice of which one to use 
depends upon the particular mechanisms that are available. If one has fast digital memory, 
adders and multipliers, one will use the FFT, and if one has a laser and photographic plates! 
one will use an optical algorithm. In general, mechanisms are strongly determined by 
hardware, the nature of the computation is determined by the problem, and the algorithms 
are determined by the computation and the available mechanisms. 

Each level of description has its place in the eventual understanding of 
perceptual information processing, and it is important to keep them separate. Too often in 
attempts to relate psychophysical problems to physiology there is confusion about the level at 
which a problem arises - is it related mainly to biophysics (like after-images) or primarily to 
information processing (like the ambiguity of the pecker cube)? More disturbingly, 
although the top level is the m0$t neglected, it is also the most important. This is because 
the structure of the computations that underly perception depend more upon the 
computational problems that have to be solved than on the particular hardware in which 
their solutions are implemented. There is an analog of this in physics, where the global 
descriptions of thermodynamics represented, at least historically, the first stage in the study 
of matter. A description in terms of mechanisms or elementary components appeared 
afterwards. rr 

Our main point then is that the topmost of our four levels, that at which 
the necessary structure of computation is defined, is a crucial but neglected one. Its study is 
separate from the study of particular algorithms, mechanisms or hardware, and the 
techniques needed to pursue it are new. In the rest of the article, we summarize some 
examples of theories at the different levels we described, and illustrate the types of 
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Figure 1. la shows the distributions of the error angle #ft during stationary fixation of 2- 
stripe patterns (lower figure). The varying parameter is the angular separation of the two 
black, vertical stripes. The upper figure shows corresponding histograms obtained from eq. 
(I). A typical phase transition occurs in the stationary fixation distribution for a value of 
the parameter between 40 and 60 degrees. The nonlinear superposition of very simple, local 
mechanisms (see Fig. 5b) leads to such a symmetry breaking. An open loop analysis (for 
instance via electophysiology) cannot predict this closed loop behavior without the 
phenomenological theory. (From Reichardt and Poggio, 1976). The same phase transition 
behavior can be observed in the fixation of the Mueller-Lyer figures (fig lb) The 
histograms extend from -180 to 180 degrees and show (he fraction of time the fly gazed at 
any part of the two figures. T|ie results conform to the theory's predictions (from Geiger 
and Poggio, 1975). A fly fixates (in the horizontal degree of freedom) the -illusory" vertical 
line arising at the boundary between the two sets of parallel horizontal lines in the upper 
pattern (fig. lc). There is no fixation, however, of a similar illusory, vertical line in the 
lower pattern. The phenomenological theory correctly predicts these results. (From Porcio 
and Geiger, unpublished). • ^ 6 
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Figure 2. The geometry of constraints on the computation of binocular disparity. 2a 
illustrates the constraints for the case of a one-dimensional image. Lx and Ly represent the 
positions of descriptive elements from the left and right views, and the horizontal and 
vertical lines indicate the range of disparity values that can be assigned to left-eye and 
right-eye elements. The use-once condition states that only one disparity value may be 
assigned to each descriptive element That is, only one disparity value may be "on" along 
each horizontal or vertical line. The second condition states that we seek solutions in which 
disparity values vary smoothly almost everywhere. That is, solutions tend to spread along 
the dotted diagonals, which are lines of constant disparity, and between adjacent diagonals. 
2b shows how this geometry appears at each intersection point. The constraints may be 
implemented by a network with positive and negative interactions that obey this geometry, 
because the stable states of such a network are precisely the states that satisfy the constraints 
on the computation. 2c shows the constraint geometry for a 2-dimensional image. The 
negative interactions remain essentially unchanged, but the positive ones now extend over a 
small 2-dimensional neighbourhood. A network with this geometry was used to perform the 
computation exhibited in figure 6. 
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prediction that can emerge from each. 

Examples of computational theories 

1: Orientation behavior of the fly (Reichardt, 1970; Reichardt & Poggio, 1976) 

The approach of Reichardt and Poggio towards an understanding of the 
visual control system of the fly is an example of the characterization of a "simple" complex 
system. Their description accounts in a quantitative way for orientation, chasing and 
spontaneous pattern preference behavior. Although connections with physiological and 
anatomical data are being established, the theory was based on behavioral data. The theory 
outlines the basic logical organization of the visual control system of the fly. It holds that 
the nervous system performs two main computations on the visual input, one extracting 
movement information, the other providing position information. The theory leads to the 
equation of motion #- 7 

ejf(t) ♦ k^(t) ♦ k«(t) - -Dty(t)] - r^(t) ♦ N(t) (1) 

T^r ^J^^JF" * {t) re P rcsent$ the instantaneous position of the pattern on the 
retina of the fly. The terms on the left hand side represent the flight dynamics (8 is the 

E!Tif xl rt \u l u C *?' V s * r0taU0nal friction con$tant) - «W is the angular speed of 
ine ooject. The right hand side describes the instantaneous torque of the fly: the term N(t) 
is a zero-mean random process, and is independent of the visual input; ri(t), a velocity- 
dependent optomotor response, is the result of "movement computation"; DM) carries the 
position information, acquired from the visual input by the "position computation". All 
Uiese terms have been characterized quantitatively, through independent experiments. 
Through equation (1) the theory predicts a rather complex natural (closed loop) behavior 
characterized by phase transition-like phenomena, and by primitive classifications of 
patterns Figure 1 gives two examples of behavior that is quantitatively explained by this 
approach The quantitative description of equation (1) could not have been obtained from 
single cell recordings or from histology. Furthermore, it is probably a prerequisite of any 
full understanding at the level of circuitry. ^ ' 

. t ' Th * othcr *xamp1« that we describe come from work on visual 

information processing that has been carried out at the M. I. T. Artificial Intelligence 
Laboratory over the last two years, * 

2: StAeopsis (Marr 1974, Marr & Poggio in preparation) I 

: k ' Suppose that images of a scene are available, taken from two nearby 

ffi^J K- Same ^° riZOn ^ ,CVe,J In ° rdCr * C<im P UtC ster *°*°Pfc disparity, the following 
steps must be carried out: (1) a particular location on a surface in the scene must be chosen 
from one image; (2) that location must be identified in the other image; (3) the relative 
positions of the two images of that location must be measured. Notice that methods based 
60 grey-level correlation between images fail to satisfy these conditions because a grey-level 
measurement does not define a point on a physical surface independently of the geometry of 
the imaging deyice The matching must be based on objective markings that lie on a 
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physical surface, and so one hat to Use predicates that correspond to changes in reflectance 
One way of doing this is to obtain a primitive description of the intensity changes that exist 
in each image, and then to match these descriptions. Line and edge segments, blobs, and 
edge termination points correspond quite closely to boundaries and reflectance changes on 
physical surfaces. i e 

mlm .«„. -. , ., Th * $ter /° pr0ble ! m may thu$ be reduced t0 that ° r matching two 
primitive descriptions, one from each eye. One can think of elements of these descriptions 
as having only position information like the black points in a random-dot stereogram 
although in practise there exist some rules about which matches between descriptive 
elements are possible, and which are U. There are two crucial constraints on the way in 
which the left and right descriptions are combined: 7 

(1) Thi usrmo conditim. Each item from each image may be assigned at most one disparity 
value. This condition rests on the premise that the items to be matched have a physical 
existence, and can be in only one place at a time. 

(2) Continuity. Disparity varies smoothly almost everywhere. This condition is a 
consequence of the cohesiveness of matter, and it states that only a relatively small fraction 
or the area of an image is composed of boundaries. ' 

These conditions on the computation are represented geometrically in figure 2a. Later in 
the article we exhibit a network that implements these conditions, and we illustrate how it 
solves random-dot stereograms. 

3: Tht Primal Skttck (Marr 1976a) 

It is a commonplace that a scene and a drawing of the scene appear very 
similar, despite the completely different grey-level images to which they give rise This 
suggests that the artist's local symbols correspond in some way to natural symbols that are 
computed ourof the image during the normal course of; in Interpretation. The first part of 
h.s visual information theory asiru .hat the first operation; on an image is to transform . 
n« its raw primal s^tch, whic,h i, .primitive but rtiii dvcriptUm of the intensity ch»£« 
that are present. Figure S shows, an example. In order to obtain this description, 
approximations to the first and second directional derivatives of intensity are measured a 
ttZZT, °7 and I ° n ^ ** ^herein' *• image, and the* me.suran.nu 
KrTSw^, **&*&* ■""""ti TI ?«Prt«« of computing the primal 
iketch involves! fiva important steps, the first of which can be compared with the 
hWsurements that are apparently mad, by simple cells in the visual cortexl One prediction 

£™!tJL F" *' ,h * ry ^ T* " 8 ' Ven mten,it * ehan 8 e "* lf Amines which 
Simpte^ll measurements are used » describe it This is in direct contrast to theories which 
issert that eac» jjsimple cell acts as a 1 >ature-detector". whose output is freely available to 
b!ffllEu^ t An0,her Pr ^" p,0n * that ' w «H-d«nned interaction must take place 

vS^L^T^ ^r me, ' Urem0,B mad * " the *"■ ««««tai and posiuon in the 
vijual field but with different receptive field sizes. 

4: Cfmping an£ texture vision (Marr 1976a) 

I The primal sketch of an image is in general a large and unwieldy 



Marr & Poggio 



From computation to neural circuitry 



Figure 3. 3a shows the image of a toy bear, printed in a font! with 16 grey levels. In 3b, the 
intensity at each point is represented along the z-axis. 3c illustrates the spatial component of 
the raw primal sketch as obtained frofa this image. Associated with each line segment are 
measures of contrast, type and extent oY the intensity change, position and orientation. This 
image is so simple that purely local grouping processes suffice to extract the major forms 



from the primal sketch. These forms 
21). 



are exhibited in 3d, e 
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collection of d^ta. This is an unavoidable consequence of the irregularity and complexity of 
natural images; The next important computational problem is how to decode the primal 
sketch. For mjwt images, it is unnecessary to invoke specific hypotheses about what is there 
until considerably later in the proctsping. Th^ theory next applies a number of quite 
general selection and grouping processes to elements in the primal sketch. The purpose of 
these processes is to organize the local descriptive elements into forms and regions, which are 
closed contour! groups that are obtained in various ways. Regions may be defined by their 
boundaries, which have been formed bfy grouping together some set of edge, line, or place- 
tokens; or thev may be defined by a first-order: predicate operating on the primal sketch 
elements within it. This second method corresponds to the definition of a region by a 
texture, and itj leads to a theory of the processes oh which texture discrimination is based. 

! It is important to realize that the descriptive items that may be grouped 
here can be very abstract - like tokens for the end of a line, a blob, or a constructed line 
that joins two! blobs. Tokens are created for each new group, and these tokens themselves 
become subject to the operation of the same or similar grouping processes as operated on 
elements of tjie raw primal sketch. The grouping processes are very conservative. They 
satisfy a principle that seems to have general application to recognition problems, called the 
prtnctple of least commttment. Only "obvious" groupings are made, and where there is doubt 
between two possible groupings, both are constructed and held pending subsequent selection. 
Figure 3 illustrates some results of applying these grouping processes. 

5: 3-D representation of shape (Marr & Nishihara 1975) 

The last two components of the theory concern the representation of 
three-dimensional shapes. One component deals with the nature of the representation 
system that is used, and the other with how to obtain it from the types of description that 
can be delivered from the primal sketch. The key ingredients of the representation system 
are: 

(a) The deep structure of the three-dimensional representation of an object consists of a 
stick figure, where in formal terms each stick represents one or more axes in the object's 
generalized cylinder representation. In fact, a hierarchy of stick figures exists, that allows 
one to describe an object on various scales with varying degrees of detail. 

(b) Each stick figure is defined by a propositional database called a 3-D model. The 
geometrical structure of a 3-D model is specified by storing the relative orientations of pairs 
of connecting axes. This specification is local rather than global, and it contrasts with 
schemes in which the position of each axis is specified in isolation, using some 
circumscribing frame of reference. 

(c) When a 3-D model is being used to interpret an image, the geometrical relationships in 
the model are interpreted by an essentially analogue mechanism called the image-space 
processor. This mechanism is computationally very simple, and may be thought of as a 
limited resolution device for representing the positions of two vectors in 3-space, and for 
computing their projections onto the image. 

(d) During recognition, a sophisticated interaction takes place between the image, the 3-D 
model, and the image-space processor. This interaction gradually relaxes the stored 3-D 
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model onto the axes computed from the image. Some facets of this process resemble the 
computation of a S-D rotation, but a simple computer graphics metaphor is misleading. In 
fact, the rotations take place on abstract vectors (the axes) that are not even present in the 
original imagje; and at any moment, only two such vectors are explicitly represented. 

The essence of this part of the theory is a method for representing the 
spatial disposition of the parts of an object and their relation to the viewer. 

6: 2 1/2 - dimensional analysis of an image (Marr & Vatan, in preparation) 

In simple images, the forms delivered from the primal sketch correspond 
to the contours of physical objects. Finally therefore, we need to bridge the gap between 
such forms and the beginning of the 3-D analysis described in the previous paragraph. We 
call this 2 1/2 - dimensional analysis, and it consists largely of assigning to contours labels, 
that reflect aspects of their 3-dimensional configuration, before that configuration has been 
made explicit. The most powerful single idea here is the distinction between convex and 
concave edges and contour segments. One can show that these distinctions are preserved by 
orthogonal projections, and can be made the basis of a segmenting technique that 
decomposes a figure into 2-D regions that correspond to the appropriate 3-D decomposition 
for a wide range of viewing angles (see figure 4). The theory assigns many alternating 
figure effects like the Necker cube to the existence of alternative self-consistent labelling* 
computed at this stage. 

It is perhaps worth mentioning one interesting point that has emerged 
from this way of recognizing and representing 3-D shapes. Warrington & Taylor (1973) 
described patients with right parietal lesions who had difficulty in recognizing objects seen 
in "unconventional" views - like the view of a water pail seen from above. They did not 
attempt to define which views are unconventional. According to our theory, the most 
troublesome yiews of an object will be those in which Its stitk-figure axes cannot easily be 
reared frOrnJhe image. The theory therefore presets that unconventional views in the 
Warrington & Taylor sense: will correspond to those vie^s in| which an important axis in the 
obje^s generalized cylinder represehtjation is foreshortened. Such views aW by no means 
uncommon - if a 3$mm camera is directed towards yoi, you are seeing an unconventional 
view^of it, since the axis of its lens is foreshortened. Recent observations by S. Carey 
(personal communication) appear to confirm that this definition captures the important 
distinction. ! r 
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Examples of algorithms and mechanisms 

Between the top and bottojn of our four levels lie descriptions of 
algorithms and* descriptions ^mechanisms. The distinction between these two levels is 
rather subtle, since they arej often Closely relate*. The form of a specific algorithm can 
impose strong constraints on the mechanism^, and conversely. Let us consider three 
examples. ; | ] r ; 
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Figure 4. Analysis of a contour. The outline (4a) was obtained from a primal sketch just as 
figure 3d was obtained from 3a. This contour is smoothed, and then divided into convex 
and concave components (3b). The! outline is searched for deeply concave points or 
segments, which correspond to main segmentation points. There are usually several possible 
matches for each such segmentation pbint, but the correct mates for each may be found by 
eliminating relatively poor candidate The result, of this is the segmentation shown in 3c 
Once these segments have been delink corresrioridingl axes! ire easy to obtain (3d)l They 
do not usually connect, but may be related to one another b|^ intermediate lines which are 
called embedding relations (8e), TrW resulting s iik figure is shown in 3f which, according 
to the theory, is the deep structure oh Uich interpretation of this image is based. 
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Figure 5. Graphical representation (a) of the decomposition of a nonlinear, n-input 
"algorithnTinto a sum of interactions of various order. The functional representation 

S{.. Xi (t)..} - L (o) * J l!? {«!«)♦ I tf IK S <lXXj (0] ♦ ... 
where L n 'is an n-linear mapping, 'can be read from an appropriate sequence of such 
elementary graphs. Fig. 5b shows the graphs that implement the fly's orientation behavior, 
studied by Reichardt and Poggio. Several findings suggest that they may correspond to 
separate physiological modules. Characteristic functional and computational properties can 
be associated to each interaction type. (From Poggio and Reichardt, 1976). 
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/: "Simple" algorithms 

An algorithm operates on some kind of input and yields a corresponding 
output In formal terms, an algorithm can be thought of as a mapping between the input 
and the output space. Perhaps the simplest of all nonlinear operators on a linear space are 
the so-called polynomial operators. They encompass a broad spectrum of applications 
including all linear problems, and they approximate all sufficiently smooth, nonlinear 
operators. For this particular class of "simple" algorithms (i.e. representable through a 
"smooth" operator) polynomial representations provide a canonical decomposition in a series 
of simpler, multilinear operators. Figure 5 shows this decomposition in terms of interactions 
or "graphs" of various orders: in this way an algorithm, or its network implementation, may 
be decomposed into an additive sequence of simple, canonical terms, just as in another 
context, a function can be conveniently characterized by its various Fourier terms. 
Moreover, functional and computational properties can be associated with interactions of a 
given order and type. 

Poggio & Reichardt (1976) used the polynomial representation of 
functionals to classify the algorithms underlying movement, position and figure-ground 
computation in the fly's visual system. The idea was to identify which terms, among the 
diversity of the possible ones, are implied by the experimental data. Figure 5 shows the 
graphs that play a significant role in the fly's control of flight and, in this sense, 
characterize the algorithms involved. The notion that seems to capture best the 
"computational complexity" of these simple, smooth mappings is the notion of p-order 
(perceptron-order, see Poggio and Reichardt, 1976). Movement computation in the fly is of 
order 2, and figure-ground discrimination in the simple case of relative motion depends on 
fourth-order graphs, but possibly with p-order 2. A closed or Type 1 (Marr 1976b) theory of 
this kind may be a useful way of characterizing preprocessing operations in nervous systems. 
The approach has a rather limited validity however, since it does not apply to the large and 
important class of "non-smooth" algorithms, where cooperative effects, decisions and symbols 
play an essential role^ Wrjile an arbitrary number of mechanisms and circujits may 
implement t^e "smooth" algorithms, it is clear that forward" interactions between neurons 
are the most natural candidates. . ;j ! . "',| ,. : :' !' |' , 
! |j Although the various levels of description are only loosely related, 

E" owieo^ge of tbe| computation and kif | the algorithm may sometimes admit inferences at the 
vest Wvel of a^atc|my and physiojj&gy. The description of the visual system of the fly at 
t computahortalj and functional level suggests,! for instance, that different, separate neural 
structures may correspond to the different computations. Recent data support this 
conjecture. Movement computation^ (tne term r^ft) of equation (1) and the second order 
gfaph| Of Figurt'5) seems todepend mainly on receptor system 1-6, while the position 
computation (the term Dtyr(t)) of equation (1) and the "self -graph" of Figure 5) seems 
dependent on receptor system 7-8 (>Vehr hah n, 1976 and in preparation). Mutants o^ 
Drosophila, normal with respect to the movement algorithm, are apparently disturbed in the 
position algorithm (Heisenberg, in preparation): 
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2: "Cooperative" algorithms 

A more general and not closely definable class of algorithms includes 
what one might call cooperative algorithms. Such algorithms may describe bifurcations and 
phase transitions in dynamical systems. An essential feature of a cooperative algorithm is 
that it operates on many "input" elements and reaches a global organization via local but 
highly interactive constraints. An apparently cooperative algorithm plays a major role in 
binocular depth perception (Julesz J971). The stereopsis computation defined by figure 2a 
applies many local constraints to mahy local inputs to yiild a final s{ate consistent with these 
constraints. Various mechanisms cbuld implement this type of algorithms. Parallel, 
recurrent, nonlinear interactions, both excitatory and inhibitory, se^rn to represent a natural 
implementation. In the stereopsis case such a mechanism is illustrated in the rest of figure 2. 
This mechanism may be realized through many different components and circuitries. In the 
nervous system, however, there are Certain very obvious candidates, which allow some 
definite predictions. For instance, o^ei is led to conjecture the existence of disparity columns 
(actually layers) of cells with reciprocal excitatory short-range interactions on each layer and 
long-range inhibitory interactions between layers with the characteristic "orthogonal" 
geometry of figure 2. Figure 6 shows that this algorithm succesfully extracts depth 
information from random-dot stereograms. The algorithm exhibits typical cooperative 
phenomena, like hysteresis and disorder-order transitions. It is important to stress that it is 
the computational problem which determines the structure of the excitatory and inhibitory 
interactions, and not "hardware" considerations about neurons or synapses. The apparent 
success of this cooperative algorithm in tackling the stereo problem suggests that other 
perceptual computations may be easy to implement in similar ways. Likely candidates are 
"f illing-in" phenomena, subjective contours, figural reinforcement, some kinds of perceptual 
grouping and associative retrieval. In fact the associative retrieval network described by 
Marr (1971), in connexion with a theory of the hippocampal cortex, implements a cooperative 
algorithm. 

3: Procedural algorithms I 

Still another ana larger class oif algorithms is represented by the 
specification cjf procedures, arid the construction ^idj manipulation of explicit symbolic 
descriptions, ^or example, the >EJ representation tl|^r| described in part 5 jbf the previous 
section explains how the stick fteif re representation of a viewed object may be obtained 
n-onL an imaie,; and manipulated jcWing recognition-! The detailed specification of trie 
a^go^itljims involved here lis carried out by defining the datastructures that |a^e created to 
represent the situation, and by specifying procedure* that operate on these datastructures in 
accordance w^th the information jcurrently being delivered from the ima^e, and that 
available from stored, models. ' 

|j |i * This *jay of spiic^fying an algjc-rit^im is very general and powerful, 

4fthoy|h uniik* the two other ways that we disqi$se<i, it is a far cry from the Circuitry level 
of description at W^ich jneurophysiblogical experiments are carried out. In a digital 
Combu^er, one does not try| to bridge the gap between these two levels in one step, Instead, a 
basi: instruction set, an assembler, a hjgh level language (LISP, ALGOL) and a compiler 
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Figure 6. A pair of random dot stereograms (left and right), the initial state of a network 
that implements the algorithm illustrated in figure 2, and the firtt 9 iterations of the 
network operating on this stereo pair. To understand how the figures represent states of the 
network, imagine looking down on it from ab6ve. The different disparity layers in the 
network are in parallel planes spread out horizontally, and the viewer is looking down 
through them. In each plane, some nodes are^ on and some are off. Each layer in the 
network has been assigned a different gray lev^l, so that a node that is switched on in the 
lowest layer contributes a dark poirit to the imige, and one that is switched on in the top 
layer contributes a lighter point Initially (iteration 0) thfe the network is disorganized, but in 
the final state order has been achieved (iteration 9). The central square has a convergent 
disparity of 2 relative to the background, and it therefore appears lighten The density of 
the original random dot stereogram was 50*, but thealgorithm succeeds in extracting 
disparity values at densities down to less than 51 Let CMlenote the state of a cell (either 
or 1) in the S-D array of fig.2b at the n-th iteration. Then the algorithm used here reads 

,jh ° sfjh) '^ 0(ijh) 'J h ^'J h '• 

where u (x) - if x < $, and - 1 otherwise, S(ijh) is a neighborhood of cell (ijh) on the 
same disparity layer; O(ijh) represents the neighborhood of cell (ijh) defined by the 
"orthogonal" directions shown in Fig. 2b. Excitation between disparity layers is also present 
in a diagonal direction orthogonal to the layers and decreases with increasing disparity 
"distances"; D(ijh) represents the corresponding neighborhood. 
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are interposed to ease the burden of passing from the description of a computation down to 
the specification of a particular pattern of current flow. 

We may eventually expect a similar intermediate vocabulary to be 
developed for describing the central nervous system. Hitherto, only one non-trivial 
"machine-code" operation has been studied in the context of neural hardware, namely simple 
storage and retrieval functions (Marr 1969 & 1971, and Brindley 1970). 
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